Towards Sophisticated Wrapping of Web-based information Repositories

نویسندگان

  • Boris Chidlovskii
  • Uwe M. Borghoff
  • Pierre-Yves Chevalier
چکیده

Access to on-line information via the Web is exploding. Index and retrieval engines already start to integrate a huge variety of heterogeneous repositories. However, the heterogeneity issue remains, both in terms of the search formats and the formats of the result pages. In this paper we focus on html-based search and result presentations. We discuss our experience in the design, the development and the maintenance of wrappers (in the context of the Knowledge Broker project). We outline different ways to write wrappers, illustrate some of the lessons learned, and conclude by describing a semi-automatic approach for an efficient wrapping of Web-based information repositories. Throughout the paper, we give illustrating examples for hands-on readers.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Towards Contextualized Rule Repositories for the Semantic Web

Central to the semantic web are ontologies: shared conceptualizations of domains of interest expressed in an ontology language such as OWL. Rule languages complement ontology languages. For large heterogeneous bodies of knowledge on the semantic web, contextualized knowledge repositories facilitate the organization of ontological concepts. In this paper we propose a similar mechanism – contextu...

متن کامل

Taxonomy-Based Web Service Categorization Using Conceptual Parameter Descriptions

With the envisioned proliferation of Web services available on the WWW and private repositories, new and better support techniques are needed for service discovery and organization to stay manageable. Service classification under hierarchic taxonomies is commonly a key feature for properly organizing service repositories in a rational way, as well as a good foundation for sophisticated retrieva...

متن کامل

Discovering Services: Towards High-Precision Service Retrieval

The ability to rapidly locate useful on-line services (e.g. software applications, software components), as opposed to simply useful documents, is becoming increasingly critical in many domains. Current service retrieval technology is, however, notoriously prone to low precision. This paper describes a novel service retrieval approached based on the sophisticated use of process ontologies. Our ...

متن کامل

Towards Supporting Exploratory Search over the Arabic Web Content: The Case of ArabXplore

Due to the huge amount of data published on the Web, the Web search process has become more difficult, and it is sometimes hard to get the expected results, especially when the users are less certain about their information needs. Several efforts have been proposed to support exploratory search on the web by using query expansion, faceted search, or supplementary information extracted from exte...

متن کامل

Discovering services: Towards High-Precision Service Retrieval1

The ability to rapidly locate useful on-line services (e.g. software applications, software components), as opposed to simply useful documents, is becoming increasingly critical in many domains. Current service retrieval technology is, however, notoriously prone to low precision. This paper describes a novel service retrieval approached based on the sophisticated use of process ontologies. Our ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1997